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Q_i Abstract 

We study various error exponents in a binary hypothesis testing problem and 
extend recent results on the quantum Chernoff and Hoeffding bounds for product 
states to a setting when both the null-hypothesis and the counter-hypothesis can 
1-1 be correlated states on a spin chain. Our results apply to states satisfying a certain 

factorization property; typical examples are the global Gibbs states of translation- 
invariant finite-range interactions as well as certain finitely correlated states. 
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1 Introduction 

We study the asymptotics of the error probabilities in a binary hypothesis testing problem 
for quantum systems. In a rather general setting (used generally in the information- 
spectrum approach US]), one can consider a sequence of finite-level quantum systems 
with (finite-dimensional) Hilbert spaces 7i = {Tin}^^. Assume that we know a priori 
that the nth system is in state p n (null-hypothesis H ) or in state a n (counter-hypothesis 
Hi). The hypothesis testing problem for the nth system is to decide between the above 
two options, based on the outcome of a binary measurement on the system. 

A measurement in our setting means a binary positive operator valued measure {T n , I n — 
T n } where < T n < I n corresponds to outcome and I n — T n to outcome 1. If the outcome 
of the measurement is (resp. 1) then hypothesis H (resp. Hi) is accepted. Obviously 
we can identify the measurement with the single operator T n . An erroneous decision is 
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made if Hi (H ) is accepted when the true state of the system is p n (cr n ); the probabilities 
of these events are the error probabilities of the first (second) kinds, given by 

a n (T n ) := p n (I n - T n ) = Trp n (/ n - T n ) and (3 n (T n ) := a n (T n ) = Tr cr n T n , 

respectively. (Here Co denotes the density operator of a state u, given by u(A) = 
TtujA, A e B{H n ).) 

Apart from the trivial case when supp p n _L suppa n , one cannot find a measurement 
making both error probabilities to vanish; in general, there is a tradeoff between the two. 
In the general cases of interest the error probabilities are expected to tend to zero asymp- 
totically (typically with an exponential speed) when the measurements T n are chosen in 
an optimal way. In the asymmetric setting of Stein's lemma [5j [12] the exponential decay 
of the /3 n 's is studied either under the constraint that the a n 's tend to 0, or that the a n 's 
stay under a constant bound. As it was shown in [T2] and [2T], in the i.i.d. case (i.e. 
when Ti n = TCf n , p n = pf n and o n = <rf n ) the optimal exponential decay rate is given by 
—S (pi || <Ti), the negative relative entropy of p\ and <7i, thus giving an operational inter- 
pretation to relative entropy. This result was later extended to cases when the sequences 
p := {p n }^ =1 and a := {o r n }^ 1 consist of restrictions of an ergodic state p and a shift- 
invariant product state a on a spin chain [TBI 15]. In the symmetric setting of the Chernoff 
bound [U El EO] the exponential decay of the average of the two error probabilities is of 
interest. As it was shown in [1] and [20], the best exponential decay rate in the i.i.d. case 
is given by — C (pi, a±), with 

C( Pl , <t0 := - miB Ms) := logTr p{a\- . (1) 

0<s<l 

The above result shows that the quantity C plays a similar role in symmetric hypothesis 
testing as the relative entropy does in the asymmetric case. 

When an exponential bound is given on the decay of the a n 's, our interest lies in the 
following quantities [2"2"| [T§]: 

lim sup - log a n (T n ) < — r 1 , (2) 

n—*oo J 

lim sup - log a n (T n ) < -r 1 , (3) 
lim sup - log a n (T n ) < -r 1 . (4) 

n— >oo 71 I 



B(r\p\\a) := inf liminf — log/5 n (T n ) 



{T n } { n^oo n 

B(r\p\\a) := inf llimsup — \ogj3 n (T n 

{Tn} n^oo 71 

B(r\p\\a) := inf { lim - log j3 n (T„ 

{T n } I n-+oo n 



Based on the techniques developed in [lj and |20j, the identity 

B (rip 1 1 a) = B (rip II (?) = —b(r) , 6(r) := max (5) 

0<s<l 1 — S 

was proven for < r < 5 (pi || o~i) in the i.i.d. setting in [8] (where i? (r|p || a) < —b(r) was 
shown) and [T7] (where the inequality B (r\p\ \ a) > —b(r) was provided), thus establishing 
the theorem for the quantum Hoeffding bound. 

In this paper we will mainly consider the situation when p n and a n are the n-site 
restrictions of states p and a on an infinite spin chain, satisfying a certain factorization 
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property. Typical examples of such states are the global Gibbs states of translation- 
invariant finite-range interactions [10] and certain finitely correlated states [QUO]. Our 
main result is that ^ holds for such states when ip in Q is replaced with 

tfj(s) :=lim-logTrp^- s , s G [0, 1] . 

n n 

As a side-result, we recover the quantum Chernoff bound (already proven in JTU]) and a 
Stein-type upper bound for states of the above type. 



2 Preliminaries and upper bounds 

2.1 Error exponents: upper bounds 

Let A and B be nonnegative operators on a finite-dimensional Hilbert space 7i. It is easy 
to see that 

mm^{Tr A(I — T) + Tr BT} = - Tr(A + B) - - Tr \A - B\ , (6) 

and the minimum is attained at {A — B > 0}, the spectral projection of A — B corre- 
sponding to the positive part of the spectrum. The following was shown in [TJ : 

Lemma 2.1. Let A and B be nonnegative operators on a finite-dimensional Hilbert space 
H. Then 

-Tr(A + B) - ^Tt\A-B\ < Tr A S B^ S , < s < 1. (7) 

All along the paper we use the convention s := 0, s 6 R; in particular, A and B° 
are defined to be the support projections of A and B, respectively. With this convention 
s i — ► TrA s B 1 ~ s is a continuous function on R. In Appendix A we mention another 
representation of the quantity ^ given in |2"5] . 

Consider now the hypothesis testing problem described in the Introduction, and as- 
sume that we know a priori that the nth system is in the state p n with probability 
7r n G (0, 1) or in the state a n with probability 1 — 7r n . Then the Bayesian probability of 
an erroneous decision based on the test T n is 

P T n (Pn ■ On |7T„) := VT n « n (T n ) + (1 - 7T n ) f3 n {T n ) , 



and by applying Lemma 2.1 to A := ir n p n and B := (1 — 7T„)<t„ we get 

iWPn : K) := mm {P T „(Pn : <r n |vr n )} < <(1 - tt^)^ 5 Trp^<T^ s . (8) 

Note that the optimal test is of the form {7T n p n — (1 — 7r„)<7„ > 0}. Let 

^(s):=-logTrp^- s , sGR. (9) 
n 
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It is easy to see that ip n is a convex function on R for all n G N. Next, let 

ip(s) := limsup'0 n (s) , s G [0, 1] , 

n^oo 

and 

ip(a) := sup {as — ip(s)} , a G IR , (10) 

0<s<l 

be its conjugate (or polar) function. If 7r„ = ix is independent of n then (|sj) implies 

limsup - logP min (p n : a n \ir) < inf ip(s) = -<p(Q) , (11) 

n->oo U 0<s<l 

as it was pointed out in the i.i.d. case in pp. 

After this preparation, we prove the following: 

Lemma 2.2. Let p n and a n be density operators on a Hilbert space Ti, n for each n G N. 
Then for any a G IR 

limsup -log min {e~ na a n {T n ) + /3 n (T n )} < -<p(a) . (12) 
Moreover, for S n>a := {e~ na p n — b n > 0} we have 

limsup- log a n {S n> a) < ~W{a) - a] , 

n— >oo fl 

limsup- log Pn{S n ,a) < ~f{o) ■ 

Proof. Consider formula (8) with 7i n := ^np^, f° r a fixed a G IR. The optimal test is then 
{n n Pn - (1 - n n )cr n > 0} = S n>a , and by multiplying @ by 1 + e~ na we get 

nas tv, ^.1— s 
n i 



min {e-™a„(T n ) + (3 n (T n )} = e- na a n {S nA ) + (3 n (S nA ) < e~ nas Tr p s n a. 



which implies 

„ I C \ ^ na —nas rp ~s ±.1—8 QIC \ ^ —nas rp ~s ±.1— s 

for any s G [0, 1]. Thus the statement follows. □ 

Consider now an asymmetric hypothesis testing problem with an exponential bound 
on the decay of the a n 's. The relevant error exponents in this case are given in ([2]), ^ 
and Q. Obviously for any fixed r G IR 

B(r\p\\a) <B(r\p\\a) < B(r\p\\a) , (13) 

and all the above quantities are monotonically increasing functions of r. Note that for 
r < the choice T n := I n yields I? (r\p || a) = B (r\p\ \ a) = B (r\p\ \ a) = — oo, hence the 
above quantities are only interesting for r > 0. 



Lemma 2.2 yields the following corollary, that can be considered as the direct part of 



the theorem for the quantum Hoeffding bound: 
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Corollary 2.3. In the above setting 



B (r\p\\ a) < — sup (p(a) 



a: tp(a)—a>r 



The converse part of (11), inequality 



liminf - logP min (p n : a n \n) > inf ip(s) = -<f(0) 

n— >oo Tb 0<s<l 



(14) 



was shown in the i.i.d. setting in [20]. Inequalities (11) and (14) together give the theorem 



for the quantum Chernoff bound in the i.i.d. case. The main idea in proving the lower 
bound is to relate the problem to the classical hypothesis testing problem of a certain 
pair of classical probability measures associated to the original pair of quantum states. 
The same method was used to prove the lower bound in the theorem for the quantum 



Hoeffding bound in [17] . In Section 3.1 we follow (a slight modification of) this method to 



show that the converse part of inequality (12) in Lemma 2.2 holds (in the above general 



setting) if the functions ip n converge to a differentiable function on R. Apart from yielding 
the lower bound in the i.i.d. setting as a special case, there are examples for correlated 
states on a spin chain for which this criterion can be verified (see Example B.l). In 
general, however, differentiabilty seems to be rather difficult to prove, therefore we follow 



a different approach in Section 3.2 to obtain the converse part for a certain class of states 



on a spin chain, which we introduce in Section 2.2 



2.2 Spin chains and factorization property 

Let H be a finite-dimensional Hilbert space and A C B(TC) be a C*-subalgebra. For all 
k,l G Z, k < I, the finite-size algebra C[k$ '■= ®k<i<iA is naturally embedded into all 
C[ m ,n] with m < k,n > I, hence one can define C\ oc := UjfeieN^[M> which is a pre-C*- 
algebra with unit t. The spin chain C with one-site algebra A is the uniform closure 
of C\ oc . It is natural to consider C as the infinite tensor power of A, hence the notation 
C = ®kezA is also used. The right shift automorphism 7 is the unique extension of the 
maps j M : Cpy] -> X ^t A ®X. 

States on the spin chain are positive linear functionals on C that take the value 1 on 
t. A state io is translation-invariant if u o 7 = u holds. A translation-invariant state uo is 
uniquely determined by uj := {u> n }^ =1 , where u n is its restriction onto C[i, n ]- 

Definition 2.4. A translation-invariant state uj on the spin chain satisfies upper/lower 
factorization properties if there exists a positive constant 77 G IR such that 

uj < rju^-oofl] <8> u>[i i+CJ0 ) (upper factorization), (15) 
uj > r]^ 1 ^(-00,0] <8> W[i,+oo) (lower factorization). (16) 

For a fixed m G N any number n G N can be written in the form n = km + r with 
k,r G N, 1 < r < m, and consecutive applications of the above inequalities give 

^[i,n] < V k ^U m] ® ^[1.^] (upper factorization), (17) 
W[i, n ] >f l ^' m |®W[ 1]r ] (lower factorization). (18) 
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On the other hand, it is easily seen by taking n = 2m for an arbitrarily large m that 
inequalities ( Jl7| ) and (18) imply (15) and (16), respectively We will use the notation 
P £ «Sfect(C) if p n , n 6 N, are the n-site restrictions of a shift-invariant state p on C that 
satisfies the factorization properties above. 

Obviously, a product state u := cjf 00 satisfies both upper and lower factorization prop- 
erties. As it was shown in pU], finitely correlated states [6] satisfy upper factorization 
property, and in some special cases (e.g. locally faithful Markov states) also lower factor- 
ization property [10j[TT]. Another important class of states that satisfy both upper and 
lower factorization properties is that of the global Gibbs states of translation-invariant 
finite-range interactions. This result was also shown in [TO], based on the perturbation 
bounds developed in [T6] . 

Now let p. , B G <Sfact(C). Without loss of generality we can assume that they have 
the same factorization constant r\. If supp p m _L supp a m for some m then the upper 
factorization property (15) yields that supp p n _L supp a n for all n G N. Since in this case 
the hypothesis testing problem is trivial, we will always assume that supp p n and supp a n 
are not orthogonal to each other for any n, as far as the case p , a G 5f act (C) is concerned. 

Let ip n ,n G N, be as given in ([9]). The following lemma was shown in [ID] ; for readers' 
convenience and because we need an intermediate formula for later purposes, we give a 
detailed proof here. 

Lemma 2.5. The limit if)(s) := lim^oo if) n (s) exists for all s G [0,1], and if) is a convex 
and continuous function on [0,1]. 

Proof. Let m G N be fixed and n = km + r, 1 < r < m. Upper factorization property 
together with the operator monotonicity of the function x t— > x s , < s < 1, implies 



Trp^-<M^(Trp^) fc , 
where M = max{Tr p^o"^ 5 : 1 < r < m, 0<s<l}, and hence 



i>n(s) < — log M H log 7] H t/v, 

n n n 



Taking the lim sup in n, we obtain 



limsup^n(s) < Vm(s) + — log 7 ?- 



Taking the liminf in m then gives the existence of the limit. Being the pointwise limit of 
convex functions, ip is convex (and hence continuous in (0, 1)). 
In the same way as above, lower factorization property implies 



^m{ s ) log 77 < liminf ip(s) 

m n-*oo 



and we obtain the bound 



ip m (s) - —\ogi]< ip(s) < ip m {s) + — log// (19) 
m m 

for every m G N. This shows that ip is the uniform limit of the Vn' s > an d hence the 

continuity of if) follows. □ 
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3 A Chernoff-type theorem 



In this section we complement inequality (12) of Lemma 2.2 Our main interest is in the 



situation when p , a G <Sf act (C); we treat this case in Section 3.2 The main idea in this 
case is to use the lower factorization property to reduce the problem to the i.i.d. setting. 



In Section 3.1 we prove the converse inequality of (12) under the assumption that the 



functions ip n defined in ^ converge to a differentiable function ip on R. Even though 
this condition may seem to be rather abstract and difficult to verify in general, this 
approach has at least two merits. First, the converse inequality for the i.i.d. situation 



(needed in Section 3.2) follows as a special case. Second, it provides an extension from 
the i.i.d. situation that can be different from requiring the lower factorization property to 



hold, as we point out in Remark B.2 



3.1 Lower bound under differentiability 

Let ip n , n £ IM, be as given in ^ and assume that 

(al) the limit ip(s) := lim n ip n {s) exists as a real number for all sGR; 
(a2) ip is a differentiable function on R. 

Note that assumption (al) implies that suppp n cannot be orthogonal to suppa n , 
except for finitely many n's. Since all the ipnS are convex on R, ip is a convex function on 
IR as well. Let ip{s) := ip(l — s), s G R; then 

ip*(x) := sup{sx — ip(s)} and (ip)*(x) := sup{sx — ip(s)} = x + ip*(— x) 
seR seR 

are convex functions on IR with values in (— oo, +oo] (usually referred to as the Legendre- 
Fenchel transforms of ip an d ip)- Let (p(a) := sup 0<s<1 {as — ip(s)}, a G IR, as given in 



([10]); then <p(a) < ip*(a) and <p(a) = ip*(a) if and only if ^'(0) < a < ip'(l). 

We will use (a slight modification of) the method of [20] and [T7j together with the 
Gartner-Ellis theorem (see e.g. 0, Section 2.3]) to show the following: 



Theorem 3.1. Under the above assumptions 

1 



lim-log min { e - na a n (T n ) + (3 n (T n )} = -<p(a) 

n^oo n 0<T n <I n 



(20) 



for any a G IR with a ^ ip'{0), ip'(l)- Moreover, if ip* is continuous at ip'(0) and at ip'(l) 



then (20) holds for all a G IR. 



Proof. Thanks to Lemma 2.2 it suffices to prove that for any sequence of tests {T n } we 
have 

liminf-log{e-™a„(T n ) + f3 n (T n )} > -if (a). (21) 



n— >oo n 



Let 



(22) 
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be a decomposition of the densities p n and a n , where Pi, Qj are projections and Aj, rjj > 
for all i & I n , j & J n . Define the classical discrete positive measures on I n x J n by 



Pn(i,j) ■= XiTrPiQj , q n (i,j) ■= Vj^rPiQ 



j ■ 



Note that p n (I n x J n ) < 1 and q n (I n x J n ) < 1, and suppp n = suppg n holds for all n. 
Moreover, it is easy to see that Ty p n s a n ~ s = j)ei n x J n Pn{h j) s Qn{h j) 1-8 an d assump- 
tion 



(al) implies 



lim - \ogp n (I n x ,I n ) = i/)(0) and lim - logg n (I n x J n ) = 4>(l) ■ (23) 

n— »oo n— >oo 77, 

Let S n , a := {e- na /3 n - a n > 0}. Then 
e n (a) := min { e - na a n (T„) + (3 n (T n )} = e~ na Tr p n (I - S n , a ) + Tr a„S n , a 

0<T„<7 

= e" na Yl A * Tr ( J - ^a)^ " S n ,a) + ^ Tfe Tr S^QjS^ 

> e~ na Yl \Tr(I - S n , a )Pi(I - Sn^Qj + Y Vj Tl S n ,aQ,S n ,aP 

> Yl min{e- na A,, 77,} Tr [(J - S , n , a )P i (J - S n , a ) + S n , a PiS n ,a} Qj ■ 

{i,j)eI n XJ n 

Now by P, Lemma 9] we have 

-(/ - S n , a )P(I ~ S n , a ) + -S n ,aPS n , a > -P< , i G /„ . 

(This can also be seen from the operator convexity of the function Ja '■ X 1— > X* AX for a 
positive semidefinite A, which fact can be verified by a straightforward computation [22| 
Lemma 5].) As a consequence, 

2e n (a) > Y ^He~ na \,Vj}^PiQj = Y ™^{e~ na Pn{hj),qn{hj)} 
{i,j)ei n xJ n (i,i)eJnxJ n 

= e-"> n ({e"> n (z, j) < q n (i,j)}) + q n {{e~ na p n {i, j) > q n {i,j)}) 

= e~ na p n ({X n > -a}) + q n ({Y n > a}) , 

where 

X n {i,j) := - log , Y n (t,j) := - log 

are random variables with corresponding distribution measures /j„ := p n X~ x and 
/in '■= q n ° Y~ x . A straightforward computation shows that 



log / e ns:c df$ = n^„(l - s) , log / e nsx dp^ = 



s . 
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Under our assumptions (al) and (a2), the Gartner-Ellis theorem yields that 
liminf — logp„ ({X n > — a}) > — inf (■0)*(x) , 

n n x>~a 

liminf — log q n ({Y n > a}) > — mf-0*(x), 

and therefore 

liminf — log e n (a) > —m(a) := — min j inf ip*(x), a + inf (tjj)*(x) 

n fl I x>a x>—a 

We remark that the Gartner-Ellis theorem is usually stated for probability measures 
while our measures p n and q n are in general subnormalized. However, this case follows 
immediately from the standard version due to the existence of the limits in (23). 

Since mi xeR ijj*(x) = ip* (tp' (0)) = --0(0), we get mi x>a ip*(x) = -ip(0) for a < i>'(0), 
and in this case also ip(a) = —-0(0), hence m(a) < <p(a). Moreover, the same holds 
for a = if)'(0) if it is a continuity point of if)*. Similarly, mi x&R (ip)*(x) = (■0)* (if)' (0)) = 
—-0(0) = —"0(1) implies inf x> _ a (-0)*(x) = —"0(1) for —a < i/)'(0) = — i>'(l), hence m(a) < 
a — 0(1) = 93(a) when a > 0'(l). Again, continuity of (0)* at 0'(O) (i.e. continuity of 
■0* at 0'(1)) yields m(a) < <p(a) for a = ip'(l). The proof is finished by noting that for 
-0'(O) < a < 0'(1) we have hii x>a ip*(x) = ip*(a) = <p(a). □ 



In the proof of Theorem 3.4 we will need that (20) holds for every a G IR in the special 
case when both p and a are shift-invariant product states on a spin chain. To prove this, 
we first give the following lemma, which may be interesting by itself: 

Lemma 3.2. Let a and b positive elements in a C*-algebra A C 13(H), where 7i is a 
finite-dimensional Hilbert space. Let a = an d b = J^jeJ^jQj be their spectral 

decompositions (with all \i,r)j > 0), and let 0(s) := logTra s 6 1_s , s G R. Then the 
following are equivalent: 

(i) is an affine function on IR; 

(ii) there exists an s G IR such that ip"(s) = 0; 

(iii) there exist i\, . . . ,i m G /, ji, . . . ,j m G J and a 5 > such that 

p i k A Qj k 7^ and \ k = fiVik ' k = 1, . . . ,m , 

and that T,iei\{u,...,i m } p u E; e J\{j iy .., jm } Qj and EkLi P h v <?i* are mutually or- 
thogonal projections. 

Moreover, if A is isomorphic to the function algebra on a finite set X then the above are 
also equivalent to 

(iv) There exists a 5 > such that a(x) = 5b(x) for every x & X with a(x)6(x) 7^ 0. 
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Proof, (i) 
To see 

by 



11 



is obvious, and (iii) =>■ [(T)] is easy to check 



in 



define a function / and probability distributions p s , s G R, on 7 x J 



:= log A* - log 7ft 



A^TrP^- 



Note that the support of p s is the same for all s G R. A straightforward computation 
yields that 

ij ij 

which is if and only if / is constant on the support of p s , i.e. there exists a constant 
c G R such that logAj — logrft = c or equivalently A, = drjj with 5 := e c for all i,j such 
that Tt PiQj ytz o. Now if both Tr PiQj ^ and Tr Pi>Qj ^ then A, = 5rjj — Xy, hence 
i = i'. The same way Tr PiQj ^ and Tr PiQj' ^ imply j = j', and the rest of the 
statement follows. 

Finally, the equivalence of 



IV 



and (iii) in the commutative case is easy to see. □ 



Corollary 3.3. If p and a are shift-invariant product states then (20) holds for all a G R 



Proof. First note that assumptions (al) and (a2) are satisfied in this case. The spec- 
tral decompositions p\ = J2ieh anc ^ = ^j&Ji VjQj m d uce decompositions p n = 
J2iei? ^i^i an d °~n = Yujej* ^jQj as * n (22) for all n G N, where \ := A^ • . . . ■ A,;„, Pi : = 



P in and similarly for rjj and Qj. As a consequence, for the associated classical 



probabilities we have p n = pf n , q n = qf n . Then ip(s) = log Y^(i,j)ehx ,h P^(^ j) s Qi(h j) 
and 



1-8 



liminf - loge n (a) > liminf - log V mm{e-' na pf n (i, j), qf n (i,j)} . (24) 
n n n n - - 



Now we distinguish two cases. If ip is not affine then by (ii) of Lemma 3.2 we have ip"(s) > 



for all s G R, and this implies that ip'(s) is in the interior of {x G R : ip*(x) < +00} 
for all s G R. As a consequence, ip* is continuous at ^'(0) and at ip'(l), and this case is 



covered by Theorem 3.1 



Assume now that ip is affine; then by (iv) of Lemma 3.2 we have Pi(i,j) = 5qi(i,j) 
for all G S := supppi fl suppgi, hence 

^(s)=log ^2 ( s j)Y Q^-ih j) 1 ' 3 = s log 5 + log q^S) , 



and therefore 



On the other hand, 



-logqi(S), a<log5, 
— log qi (S) + a — log 5 , a > log 5 . 



(25) 



min{e-™pf n (i,j), qf n (i,j)} 



0, 

( -na 5 n q fn^j\ 



( h jJeS n , a<log5, 
H,j) e S n , a>logS. 
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and thus 



1 log V min{e-">f"(z, j), q? n (hj)} = \ 
n z — ' - - 

(i,j)e/™xj™ I 



log qxiS) 



a < log 5 . 



(26) 



log qi (S) — a + log 5 , a > log 5 . 
Formulas (|24|), (|25|) and (]26|) together give the desired statement. □ 



3.2 Lower bound under factorization 

Assume now that p , <? G «Sf act (C). Then the limit ^(s) := lim n 7/v(s) exists for all s G [0, 1] 



and ip is continuous on [0, 1], as was shown in Lemma 2.5 For each m G N let 

V? m (a) := max {as — 4>(s)} , a G IR 



be the polar function of ip m . The bound (19) implies that 

<fm(a) - — logi] < if (a) < f m (a) + — log 77 
m m 

for every a G IR and m G N, hence 9? is the uniform limit of the sequence {(f m }- 
Theorem 3.4. Let p. , a G 5f act (C). Then for any a G IR 



(27) 



lim-log min {e~ na a n (T n ) + (3 n (T n )} 

n^oo n 0<T n <I n 



■<PW- 



(28) 



Proof. Due to Lemma 2.2 it suffices to prove that (21) holds for any sequence of tests 



{T n }. We can assume that p and a have the same factorization constant 77. Let m G N 
be fixed and write n > m in the form n = (k — l)m + r with 1 < r < m. With 
7 := min{l, e ( - m ~ r ^ a ; < r < m} we have 



~ na a n (T n ) + /3 n (T„) > 7 (e- kma a n (T n ) + (3 n {T n )) 



7 (e kma Otkm{T n ® I[n+l,km]) + f3km(T n <8> /[n+l,fem])) 

mm {e" fc ™ Tr[pf (/ - T)] + Tr[^T] } 



777 



-(fc-i) 



(29) 



where we used lower factorization property in the last step. Let 



(s) := logTr/3^<x m 



mipmi 8 ) an d $m( a ) := max { as — ^m(s)} = mip m (a/m) . 

0<s<l 



By Corollary 3.3 we have 



hmmf-logmm^e-^Trtpra-T)] + Tr[&*T\} > -$ 



m (ma). 



Combining it with (29) we obtain 



hminf-log{e- ria « n (r n )+/5 n (T n )} > --log 77 

n-*ao n m 



1 



-$ m (ma) 



1 



m m 

> \ogr]-(p(a) log 77, 

m m 



log 77 - (p m (a) 



where we used (27) in the last inequality. Taking the limit in m gives the assertion. □ 
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3.3 Some remarks 



Note that equations (20) and (28) can be reformulated as 



lim - logP min (p n : a n \ir r , 



— max{(p(a), (p(a) — a} 



-ip{a) 



if a > . 



— (p(a) + a if a < 



(30) 



with n n := 1 ^ e -1 a , therefore giving an extension of the theorem for the Chernoff bound to 
a setting when the prior probabilities are not constant, but depend on n in the given way. 



From Lemma 2.2 we get that (30) holds whenever a G A(p } cr), where A (p, a) denotes 



the set of all a G R for which inequality (21) is satisfied. In particular, if G A(p,a 



then we recover the theorem for the Chernoff bound [21 [ID]. This is the case e.g. when 



p,a G «Sf act (C) (since A(p,a) = R by Theorem 3.4) and when assumptions (al) and (a2) 



are satisfied and ^ d + ^{0) and ^ d~tp{l) (since A(p,a) D R\ {9+^(0), b Y 



Theorem 3.1 ) 



Alternatively, one can interpret Theorems 3A and 3^ as the theorem for the Chernoff 
bound in the setting when hypothesis testing is performed between the states a n and the 



unnormalized states e na p n . Indeed, in the setting of Section 3.1 or Section 3.2 we have 



1 



iP a (s) := hm-logTr (e- na p n ) s a 1 ' 8 = -{as - 4(3)} , s G [0, 1] 



n n 



and 



lim — log P n 



n n 



Pn ■ cr n FJ 



min ip a (s) 

0<s<l 



for any n G (0, 1) and a G A (p, a). 



4 The HoefFding bound and related exponents 

Our main goal in this section is to derive the theorem for the Hoeffding bound in the 



settings of Sections 3.1 and 3.2 To treat the two settings in a unified way, we derive all 
our results under the following assumptions: 



(Al) The limit if)(s) := lim^oo i[) n (s) exists as a real number for all s G [0,1] and if) is 
continuous on [0, 1]. 

(A2) The inequality 



lim - log min {e" na « n (T n ) + (3 n (T n )} > -<p(a) 

n^oo n 0<T n <I n 



(31) 



holds for all a < d if>(l) (the left derivative of ip a t 1), except possibly for finitely 



many values of a, where is given in (10). 



Though assumptions (Al) and (A2) are admittedly rather artificial, they have the 
merits that they are satisfied in the cases of our interest on the one hand (see Section [3]) , 
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and on the other hand they give the minimal requirements under which the results of this 
section are valid, thus providing a better view on the logical relations among our results. 
We begin by introducing ip(s) := ip(l — s), s G [0, 1] and its polar function 

<p(a) := max {as — i/j(s)} = a + <p(—a), a G R. 

0<s<l 

We also define 

<p(a) := <p(—a) = p(a) — a , a G R . (32) 



In Appendix C we give an illustration of the above definitions and the properties listed 
in the following: 

Lemma 4.1. The functions tp and (p have the following properties: 

(i) (p is convex, continuous, and increasing on R. Moreover, it is constant —ip(0) on the 
interval (— oo, d + tp(0)] and strictly increasing on (d + ip(0), +oo). 

(ii) <p is convex, continuous, and decreasing on R. Moreover, it is strictly decreasing on 
the interval (— oo, d~ip(l)) and is constant — on the interval [d~i/)(l), +oo). 

Proof. All properties follow immediately from the very definitions of <p and <p, except for 
strict monotonicity. We only prove it for tp, as the proof for ip is completely similar. Note 
that 

p(a) = <p(a) — a = max {a(s — 1) — ip(s)}, 

0<s<l 

and let s a := argmax 0<s<1 {a(s — 1) — ip(s)}. It follows from a(l — 1) — ^(1) = — ^(1) 
that if a is such that p[a) — a > — (i.e. a < d~i[)(l)) then s a < 1. Hence for b < a 

<p{a) - a = a(s a - 1) - ip(s a ) < b(s a - 1) - ip(s a ) < <p(b) - b. 

□ 

Remark 4.2. Due to the above listed properties, one can extend <p and <p to continuous 
and monotonic functions on RU{+oo} by defining p(+oo) := +oo and <^>(+oo) := —-^(1). 

Remark 4.3. Note that ip n {s) < for all s G [0, 1], and n G N, hence the same holds 
for ip. If supp p n < supp<j n then ipniX) = z log Tr [p n supp a n ] = 0, and a straightforward 
computation shows that 

d~1p n (l) = -S (p n || <7 n ) , 

n 

where S (p n \ \ a n ) := Tr p n (log p n — log a n ) is the relative entropy of the states p n and a n . 
Convexity of ijj n implies 

Ms) >V>n(l) + (s-l)d-^n(l) = (s-l)^S(p n \\a n ) , s G [0,1]. (33) 

Assume now that supp p n < supp a n holds for all large enough n, and that the mean 
relative entropy 

S M (p || a) := lim -S (p n \ \ a n ) 

n—*oo n 
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exists. (Note that if p, a G <Sf act (C) then if suppp m C suppa m for some m e N then it 
also holds for all n e N, and the mean relative entropy exists even if one requires only 
the upper factorization property to hold [131 Theorem 2.1].) Then = 0, and taking 
the limit in (33) yields 



therefore 



^00 > (s-l)S u (p\\a) , se [0,1] 



d~ip(l) <S M (p\\a) ■ 



(34) 



Similarly, if we replace the condition supp p n < supp a n with supp p n > supp a n in the 
above argument then we get ip(0) = and d + ip(0) > —Sm (c || p)- 

Note that if p n = pf n and a n = af n , neN, with supp p\ A supp b\ ^ and supp p\ ^ 
suppax, then d~ip{l) is finite while Sm (p || a) = +oo, hence (34) cannot be expected to 
hold as an equality in general. In Appendix B we show examples for correlated states p, a 
on a spin chain for which d'^iX) = Sm {p\ \ <?) can be shown by an explicit computation. 

Lemma 4.4. Let a < d~tp(l). Then for any sequence of tests {T n } satisfying 

limsup - log ct n (T n ) < -{(p(a) - a}, 

n^oo Tl 



we have 



liminf- log /3 n (T n ) > -<p(a). 

n— >oo TL 



Proof. We follow the same argument as in [T7j. For any b satisfying (31 ) we have 



-<p(b) < liminf - log {e- nb a n (T n ) + (3 n {T n )} 



< max <{ liminf — log (3 n (T n ), —b + limsup — loga n (T n } 



< max < liminf — log j3 n (T n ), —b — (fi(a) + a 



If a < b < d V(l)> then — <p(a) + a < — ip(b) + b by Lemma 4.1 (ii), hence 



Continuity of <p then yields 



liminf- log f3 n (T n ) > -<p(b) . 

n— >oo Tl 



liminf- log (3 n (T n ) > -(p(a). 

n— >oo Tl 



□ 



Lemmas 4.4 and 2.2 give the following: 
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Corollary 4.5. For a < d and S n , a := {e na p n — a n > 0} we have 

lim -logP n (S n>a ) = -(f(a). 

n— >oo 72, 

Remark 4.6. Recall that ip is strictly increasing on the interval [d + ip(0), +oo). An 
obvious modification of the above proof then yields that for any a > d + if)(0) and any 
sequence of tests {T n } satisfying 

limsup - log p n (T n ) < -ip(a) 

n— >oo Tl 



we have 



In particular, 



for any a > d + ip(0). 



liminf-loga n (T n ) > -{(p(a) - a}. 

n— >oo Tl 



lim - log a n (S n>a ) = -{(p(a) - a} 

n— >oo Tl 



Note that d + ip(0) < d due to the convexity of ip, and the interval 

I t ■= {a G R : 9+^(0) < a < d~ip(l)} 



is nonempty if and only if the graph of tp is not a straight line segment. Corollary 4.5 and 



Remark 4J3 give the following: 
Theorem 4.7. For a G 1^ we have 



lim -logOn^o) = -{^(o) - a}, 

n— >oo -n, 

lim -log^n^a) = -</?(a). 

n^oo Tl 



Now we are in a position to prove our main result. 
Theorem 4.8. For any r G R we have 

B(r\p\\a) = B(r\p\\a)=B(r\p\\a) = — sup . 

o<s<i 1 — s 

For r < —"0(1) all the above quantities are equal to — oo; for r > we have 

ui I -I I -A -sr-tp(s) 

B (r\p \\a) — — max = — (p(a r ) , 

0<s<l 1 — S 

where a r is a unique number in (— oo, d~ip(l)] such that (p(a r ) = r. 
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Proof. First assume that r < —-0(1). A straightforward computation shows that 



-sr-ip(s) 
1-s 



tends to +00 as s / 1, hence sup 0<s<1 



-sr-ip(s) 
1-s 



-00. Let T n := I — suppa n . Then 



^loga n (T n ) = ^(1) by definition, hence limsup n ^ loga n (T n ) = -0(1) < — r and obvi- 
ously P n (T n ) = for all n. Hence (by using the convention logO := —00) we get 



B (r\p\\ a) < —00 = — sup 

0<s<l 



— sr — if)(s) 



and the inequalities in (13) give the desired statement. 

Now if r > —"0(1) then the properties of (p listed in (ii) of Lemma |4~T guarantee the 
existence of a unique a r < d~ip(l) such that <p(a r ) = r (see Figure [2] in Appendix C for 
an illustration). Note that if d + ip(l) = +00 then a r = +00 and we use the conventions 
of Remark 4.2 If limsup n - loga n (T„) < — r then there exists a b < a r such that for all 
b < a < a r we have 



limsup -loga n (T n ) < -{(p(a) - a} < -{<p(a r ) - a r } 
n n 



-r . 



By Lemma 4.4 



liminf-log/3 n (T n ) > -ip(a) 
n n 



and by taking the limit a /* a r we obtain 



liminf - log/3 n (T n ) > -ip(a r ) . 
n n 



Hence 



B (r\p\ I a) > —ip(a r ) . 
On the other hand, by Lemma 2.2 and Corollary 4.5| we have for any a < a r 



limsup-loga n (S , „ ia ) < -{(p(a) - a} < -{ip(a r ) - a r } 

n Tl 

lim-log/3 n (5' n)a ) = -(f(a), 
n 

hence B (r\p\ \ a) < —ip(a). Now taking a /* a r we get 

B (r\p\ \ a) < —ip(a r ) . 



Taking the inequalities in (13) into account, we have 

—(p(a r ) < B(r\p\\ a) < B (r\p\\ a) < B (r\p\\ a) < —ip(a r ) . 

To prove the last identity assume first that r > Then a r < d~ip{l) and s r < 1, 

where s r := argmax 0<s<1 {a r s — i/j(s)}. Thus we have 



r = (p(a r ) — a r = a r s r — ip(s r ) — a r > a r s — ip( s ) ~ a r 



< s < 1 
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hence a r > 



-rj^aj £ Qr an y o < s < 1 with equality for s = s r . Then 

— sr — ip(s) 



<p(a r ) = a r s r — ip(s r ) > a r s — ip(s) > 



1-s 



< s < 1 



(35) 



and equality holds for s = s r . 

Now if r = —ip(l) then a r = d~ip(l) and <£>(a r ) = d~ip(l) — ip(l), and one can easily 
see that 



lim 



1 - S 



If d = +oo then this gives the desired identity immediately If d < +oo 

then the statement follows from the fact that inequalities in (35) are still valid (with 
s r = 1). □ 

One can get a certain interpolation between the setting of Stein's lemma and the 
theorem for the Hoeffding bound by requiring that the a n 's converge to zero exponentially, 
but without constraint on the value of the exponent. The corresponding exponents for 
the (3nS are B (0|p || a) , B (0|p|| a) and B (0|p || a). We have the following: 

Proposition 4.9. If ip(l) = then 

B(0\p\\a) = B(0\p\\a) = B(0\p\\a) = -d~^{l) . 

Moreover, if p and a satisfy the upper factorization property and supp p n < supp a n for 
all n G N then 

B(Q\p\\3) > -S u (p\\(j) ■ 



Proof. The first statement is a special case of Theorem 4.8 To see this, take r = = 
0; then a r = d~ip(l) and ip(a r ) = d~ip(l). The second statement follows from (34). □ 

In studying Stein's lemma, one is interested in the exponents B(p || a), B_(p 1 1 <?) and 
B[p 1 1 a), where 

1 



4.8 



B{p\\ a) := inf < lim - log (3 n (T n ) lim a n (T n ) = 

and B(p\\a) and -B(p||c?) are defined similarly, by taking lim inf - log j3 n (T n ) and 
limsup^^ i log (3 n {T n ). Obviously, B_(p\\a) < B(p\\a) < B(p\\a), and Theorem 
has the following consequence: 

Proposition 4.10. If ^(1) < then 

B(p\\ a) = B(p\\ a) = B(p\\ a) = — oo = -S M (p || o) ■ 



If V(l) = then 



B(p\\a) < -d~i)(l) 
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Proof. If tp(l) < then there exists an iV 6 N such that Tr p n [supp cr„] < 1 for all 
n > N, i.e. suppp n ^ supp<T n and hence S(p n \\a n ) = +oo for all n > N, implying 
Sm{p\\°') — +oo. The rest of the statements follow immediately from the fact that 



B(p\\cr) < B (0\p\ \ a), and that i?(0|p||(x) = — oo when ?/>(!) < by Theorem 4.8 and 



B (0|p || cr) = —d when = by Proposition 4.9 □ 



Note that when p = pf 00 and a = af 00 are product states with supp p\ < supp d~i 
then = and d~ip(l) = S (pi\ \ o"i), and we get back the well-known formula for the 
direct part of Stein's lemma. 



5 Concluding remarks 



We have studied various error exponents, including the Chernoff and Hoeffding bounds 
in a binary (asymptotic) hypothesis testing problem. While following a rather general 
formulation, the main applicability of our results is the hypothesis testing problem for 
two states on a spin chain, both satisfying the factorization properties given in Definition 
2.4 That the study of such states is sufficiently well motivated was established in 



where we have shown that the factorization properties are satisfied by the global Gibbs 
states of translation-invariant finite-range interactions. Other important examples for 
states to which our results may be applicable are provided by the Markov-type class of 
finitely correlated states [6] (see e.g. Example B.l and [10]). While finitely correlated 

[10J, it is an open question at the 



states always satisfy the upper factorization property 
moment to find necessary and sufficient conditions for the lower factorization property to 
hold. As Remark |B.2 suggests, it may be possible to prove the validity of assumptions 
(al) and |(a2)| for finitely correlated states even if the lower factorization property fails to 
hold. Similar conditions to our factorization properties were used at various places in the 
literature. Probably the closest to our factorization properties is the *-mixing condition 
(see e.g. [I] and references therein). However, the relation among these conditions and 
the factorization properties in the quantum setting is an open question at the moment. 



Our main tool in deriving the upper bounds in Lemma 2.2 was the powerful trace 
inequality ^ discovered in [1] , that was successfully applied to give the Chernoff [1] and 
Hoeffding j8] upper bounds in the i.i.d. case. Our Corollary 2.3 is an extension of [H 



Theorem 1] to the very general setting of Section 2.1, and a slight simplification as well, 
as the tests only depend on the parameter a, and not on s as in [8]. 



The main idea in deriving the lower bound (21 ) in Section 3.1 is to relate the quantum 



problem to a classical hypothesis testing problem by the method of [20] and then use large 
deviation techniques to treat the classical problem. This approach was used in the i.i.d. 
case to derive the quantum Chernoff [20] and Hoeffding [T7] lower bounds. In [17] the two 
states were implicitly assumed to have the same support, which assumption can easily be 
removed by restricting the classical probability distributions onto the intersection of their 
supports; this approach was followed in [2]. In Section 3.1 we have followed a different 



way to circumvent the restriction of equivalent supports, by slightly modifying the way to 
assign classical measures to the original states. In the non-i.i.d. case it is a natural choice 
to use the Gartner-Ellis theorem to establish the lower bound in the classical hypothesis 



testing problem, and the differentiability condition in assumption (a2) is essentially the 
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requirement of the differentiability of the logarithmic moment generating function in the 
Gartner-Ellis theorem. As we argue in Section ^3 the main results of Section[3j Theorems 



3.1 and 3.4 are essentially giving the theorem for the Chernoff bound in an appropriate 



setting. The fact that Corollary 3.3 is true for all real numbers a seems to be new even 
in the i.i.d. setting. The exclusion of the cases a = d + tp(0) and a = d~tp(l) in Theorem 



3.1| is strongly related to the possibility of a pathological situation when the graph of ip 
becomes a straight line, and could possibly be removed if a similar characterization to 
that in Lemma [3.21 was available also in the non-i.i.d. case. 

It is well-known in the information spectrum approach that the limits of the quantities 
- log a n (S nta ) and - log /3 n (S nta ) are strongly related to the theorem for the Hoeffding 
bound, as was emphasized e.g. in [7j and [19]. In Lemma 4.4 we follow the way of [TF] 



to derive these limits from the Chernoff-type theorems of Section [3j Theorem 4.7 



was 



stated as a conjecture in [T7] and it was proven shortly after in [18] in the i.i.d. setting for 
the values of a between — 5m (ci \ \ Pi) and 5m (pi || c^); this coincides with our 1^ when 
supp pi = supp 01 is assumed in the i.i.d. setting. The importance of the above limits are 



clearly shown by the fact that the results of Theorem 4.8 and Propositions 4.9 and 4.10 



hold true whenever Corollary 4.5 is true (here we benefit from the fact that Lemma 2.2 



is unconditionally true in the most general setting, showing again the power of inequality 
0)- 

The interpretation of Q(p, a) := mino^^x Tr p s a l s as a distinguishability measure 
on the state space of a finite dimensional quantum system was investigated in [T] , where 
a detailed analysis of its properties and its relation to other measures (like fidelity, trace 
distance and relative entropy) was given. Here we would like to stress the importance of 
its negative logarithmic version 



C (p, a) := — min \ogTr p s a 

0<s<l 



1-s 



- log Q(p, a) 



It is jointly convex in its variables (due to Lieb's concavity theorem), monotonic decreasing 
under 2-positive trace-preserving maps [231 El], and easily seen to be strictly positive, 
thus sharing some of the most important properties of relative entropy. Moreover, if 
p , a e 5f ac t(C) then the uniform convergence established in Lemma 2.5 shows that the 
limit 

C M (p, a) := lim -C (p n , a n ) (36) 

n^oo TL 

exists (and coincides with ip(0)), further extending the analogy with the relative entropy. 



Theorem 3.4 for the Chernoff bound gives that 



lim - log min {a n (T n ) + (3 n (T n )} = -C M (p, a) , 

n-*oc n 0<T n <I 

thus giving an operational interpretation to the mean Chernoff distance, and showing that 
it plays exactly the same role in the symmetric setting of the theorem for the Chernoff 
bound as the mean relative entropy plays in the asymmetric setting of Stein's lemma. 



Obviously, the asymptotic quantity (36) is still jointly concave and monotonic decreas- 



ing under 2-positive trace-preserving maps; it is not clear, however, whether the strict 
positivity property is preserved under taking the limit. 
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Appendix A 



Note that Lemma 2.1 becomes trivial when B(TC) is replaced with a commutative C*- 
algebra. Indeed, in this case elements of the algebra are functions on some compact space 
X, and if /, g are non-negative functions then 

\ (f(x) + g(x) - \f(x) - g(x)\) = mm{f(x),g(x)} < /(x)^(x) 1 ^ , s G [0, 1] , 

and the trace can be replaced with integration with respect to an arbitrary positive 
measure. The minimum of two nonnegative operators A,Be B(7i) might be defined as 
the unique self-adjoint X G B(H) for which 

X < A, X < B and Y < A, Y < B ^ Y < X 

holds. Note, however, that such an operator does not exist in general, even when A and 
B commute with each other. On the other hand, the following is true: 

Proposition A.l. For any nonnegative operators A,B G B(7i), we have 

max{TrX : X < A, X < B} < TiA'B 1 - 3 , s G [0, 1] . 



The proof follows immediately from Lemma 2.1 and the following lemma, which is a 
special case of the duality theorem in multiple hypothesis testing [25], E] . Since the proof 
of the binary case is immediate, we include it for readers' convenience. 

Lemma A. 2. For any nonnegative operators A, G B(7i), we have 

maxiTr X : X < A, X < B\ = min {Tr Ail — T) + Tr BT\ 

0<T<I 

= Tr -(A + B- \A-B\) . 

Proof. Suppose that X < A, X < B. Then X = X* and for any operator T satisfying 
< T < / we have 

TrX = TrX(7 — T) + Tr AT < Tr A(I - T) + Tr BT. 

Conversely, let S := {A- B > 0} and X := A(I - S) + BS = \{A + B - \A-B\). Then 

X = A- (A-B)+ < A, X = B-(A-B)_ < B, 

where (A — B) + and (A — B)_ denote the positive and the negative parts of A — B, 
respectively. □ 

Loosely speaking, the above shows that as long as Tr is taken, | (A + B — \ A — B\) can 
be considered as the minimum of A and B. Note, however, that ^ (A + B — \A — B\) need 
not even be positive semidefinite when both A and B are positive semidefinite. A simple 
counterexample is given by A := \a)(a\ and B := \b)(b\ with a = (l,i) and b = (1, 1). 
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Appendix B 



Unfortunately, assumptions (al) and |(a2)] in Section [3~T seem to be difficult to verify in 
general correlated cases. Below we show a specific example for a pair of correlated states 



on a spin chain for which assumptions (al) and (a2) can directly be verified. Note that the 



example is non-classical in the sense that the local densities p n and a n need not commute 
with each other. However, both p and a exhibit only classical correlations among the sites 
of the chain, i.e. all local densities p n , a n , n G N, are separable. 

Example B.l. Let T = {T xy } Xtye x and S = {S xy \ x ^x be irreducible stochastic matrices 
with corresponding faithful stationary distributions r = {r x } x£ x and p = {p x } x ^x on 
some finite set X . Moreover, let {& xy \ x ^x and {(p xy } x , y £x be sets of states on a finite- 
dimensional C*-algebra A and Q x := ^2 y T xy $ xy , <& x := J2 y S xy ^p xy . The local states 



Pn 



^ ^ r xi {T XlX2 q 9 XlX2 ) ( 

xi,...,x„£X 



(T Xn _ lXn r & Xn _ lXn ^J <g> Q Xn 
{S Xn _ 1Xn ip Xn _ lXri ^ <8> 



xi,...,x„GX 



are easily seen to extend to translation-invariant states p and a on the spin chain C = 
t&kez-A. (Actually, p and a are ergodic finitely correlated states with a commutative 
auxiliary algebra in their generating triples; see [6] and also [TO] for details.) 

Let us assume that there exists a set of non-zero projections {P x }^x in A with or- 
thogonal ranges such that 



supp § xy V supp (p xy < P x and also supp d xy A supp (p xy ^ , x, y G X. (37) 



Then 



= (a^Q^r 1 ^)) 
for every s G R with 

b(s) x : Tr (-).;>!•: 5 and Q(.s),,, := TlS^Tr 



as 



rV~ s 

' xfx 1 



xyrxy 



Now if Q(s) is irreducible for some (and hence for all) s G IR then by the Perron- Frobenius 
theorem we have 

ip(s) = lim -log(a(s),Q(s)" _1 o(s)) = logr(s) , sGR, 

n— >oo n 

where r(s) is the spectral radius of Q(s) (see e.g. [5j Theorem 3.1.1]). Being a simple 



eigenvalue, the function s \—> r(s) is smooth (cf. [IS]), and so is ip, hence assumptions (al) 



and (a2) are satisfied in this case. 
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Assume now that supp p n < supp a n for some n > 2. This is easily seen to be equivalent 
to the conditions 

T xy = if S xy = 0, (38) 
supp "& xy < supp (p xy if T xy >0, (39) 

and hence is independent of the value of n. (Note that the first condition states that the 
classical Markov chain generated by T and r is absolutely continuous with respect to that 
generated by S and p.) It is easily seen that in this case Q(s) is irreducible for every s G R, 
hence we can apply the above argument and obtain ip(s) = log r(s), s 6 R. Simplicity 
of r(s) as an eigenvalue of Q(s) yields that one can choose the corresponding Perron- 
Frobenius (left) eigenvectors £(s) = {£ x ( s )}xe.x to form a strictly positive probability 
distribution for all s G R such that the function s h- > £(s) is smooth (cf. [IS])- Let 
e := (1, . . . , 1) be the identity vector. Using the facts that r(s) = (£(s)Q(s),e), that 
(£'(s),e) = due to the fact that (£(s),e) = 1 for all s, and that Q(l) = T, = 
r, r(T) = 1, we obtain 

1^(1) = 7§y = <?(i)>G(i)e> + (e(i),Q'(i)e) 



TxT X yS ($xy 1 1 ^xy) • 



xy 



A straightforward computation then shows that the latter expression is exactly the mean 
relative entropy Syi{p\\o). Similarly, if we impose the condition supp p n > suppo- ra for 
some (and hence for all) n > 2 then we obtain ip'(0) = —Su {p \ \ p)- 

Remark B.2. In the above construction, let A be isomorphic to the function algebra 
on some finite set X, and let d xy := (p xy := \s x \ (the indicator function of {x}) for all 
x, y G X. Then p n and a n are the densities of the n-site restrictions of the Markov 



measures p and v generated by (T, r) and (S,p), respectively, and the conditions (37) and 



(39) are automatically satisfied. 

Now it is easy to see that p and v satisfy the lower factorization property if and only 
T and R are entrywise strictly positive matrices, which condition is sufficient but not 



necessary for (38) to hold. Thus the above construction provides examples for situations 



when the lower factorization property is not satisfied while assumptions (al) and (a2) 
hold true. 

Example B.3. Let p and a be the global Gibbs states of translation-invariant finite- 
range interactions $ and respectively. The local Gibbs state p^ for $ has the density 
e~ Hn ^> / Tr e~ Hn ^\ where H n (<&) is the local Hamiltonian of $ inside [l,n]; the local 
Gibbs state cr^ is defined similarly for \I>. Since there is a constant A > 1 (independent 
of n) such that X~ 1 p n < p^ < Ap n and A" 1 ^ < < Xa n (see [HI Lemma 2.1]), the 
function ip is written as 

Ms) = lim - log Tr e --#»(*) e -(i-»)#«W _ s p($) - (l- s )P(tf), s G R, 

n— >oo Tl 
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where P($) ■= lim^oo - logTre Hn ^\ the pressure of We have ^(O) = ^(1) = 0, and 
the Golden-Thompson trace inequality yields 

ip(s) > f(s) := P(s$ + (1 - s)tf) - sP($) - (1 - s)P(tf), s G R. 

Here equality cannot hold in general as is immediately seen in the case of product states. 
Note that £(s) corresponds to one of the candidates proposed in (22] to obtain the Quan- 
tum Hoeffding bound. By (TTJ Theorem 2.4 and Lemma 2.3] we notice that £ is differen- 
tiable on R and moreover 

£'(0) = aP*($-^) -P($) + P(tf) 

= -<7(4*_») - P($) + s(<r) - a(^) 
= s(<r) - <r(A*) - P($) 
= -SWHIp)> 

where A$ is the mean energy of $ and s(<r) is the mean entropy of o. Similarly £'(1) = 
Sm{p\W)- Hence we have i/j'(0) = — Sm{p\\p) and ip'iX) = Sm(p\W) as long as if) is 
different iable at 0,1. In particular, when H n (<&) and H n (ty) commute for all n, it is 
obvious that tj) = £. But this situation is essentially classical since for all n the densities 
of p n and a n commute, too. 

Appendix C 




Figure 1: the definitions of (p and <p with a typical ip 
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0(a) = (f(a) — a 




Figure 2: the graphs of y? and ip in a typical case 
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